AITopics | level model

Collaborating Authors

level model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Reviews: Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin

Neural Information Processing SystemsOct-8-2024, 10:07:17 GMT

The paper presents a novel method for predicting gene regulation by LSTM with an attention mechanism. The model consists of two levels, where the first level is applied on bins for each histone modifications (HM) and the second level is applied to multiple HMs. Attention mechanism is used in each level to focus on the important parts of the bins and HMs. In the experiments, the proposed method improves AUC scores over baseline models including CNN, LSTM, and CNN with an attention mechanism. This is an interesting paper which shows that LSTM with an attention mechanism can predict gene regulation.

attention mechanism, gene regulation, selective attention, (10 more...)

Neural Information Processing Systems

Genre: Research Report (0.45)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.87)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

SuperCLUE-Fin: Graded Fine-Grained Analysis of Chinese LLMs on Diverse Financial Tasks and Applications

Xu, Liang, Zhu, Lei, Wu, Yaotong, Xue, Hang

arXiv.org Artificial IntelligenceApr-29-2024

The SuperCLUE-Fin (SC-Fin) benchmark is a pioneering evaluation framework tailored for Chinese-native financial large language models (FLMs). It assesses FLMs across six financial application domains and twenty-five specialized tasks, encompassing theoretical knowledge and practical applications such as compliance, risk management, and investment analysis. Using multi-turn, open-ended conversations that mimic real-life scenarios, SC-Fin measures models on a range of criteria, including accurate financial understanding, logical reasoning, clarity, computational efficiency, business acumen, risk perception, and compliance with Chinese regulations. In a rigorous evaluation involving over a thousand questions, SC-Fin identifies a performance hierarchy where domestic models like GLM-4 and MoonShot-v1-128k outperform others with an A-grade, highlighting the potential for further development in transforming theoretical knowledge into pragmatic financial solutions. This benchmark serves as a critical tool for refining FLMs in the Chinese context, directing improvements in financial knowledge databases, standardizing financial interpretations, and promoting models that prioritize compliance, risk management, and secure practices. We create a contextually relevant and comprehensive benchmark that drives the development of AI in the Chinese financial sector. SC-Fin facilitates the advancement and responsible deployment of FLMs, offering valuable insights for enhancing model performance and usability for both individual and institutional users in the Chinese market..~\footnote{Our benchmark can be found at \url{https://www.CLUEbenchmarks.com}}.

financial information extraction, information extraction, qualification, (14 more...)

arXiv.org Artificial Intelligence

2404.19063

Country: Asia > China (0.06)

Genre: Research Report (0.50)

Industry:

Banking & Finance > Trading (1.00)
Banking & Finance > Financial Services (0.95)
Materials > Metals & Mining > Coal (0.94)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

PLMM: Personal Large Models on Mobile Devices

Gong, Yuanhao

arXiv.org Artificial IntelligenceSep-26-2023

Inspired by Federated Learning, in this paper, we propose personal large models that are distilled from traditional large language models but more adaptive to local users' personal information such as education background and hobbies. We classify the large language models into three levels: the personal level, expert level and traditional level. The personal level models are adaptive to users' personal information. They encrypt the users' input and protect their privacy. The expert level models focus on merging specific knowledge such as finance, IT and art. The traditional models focus on the universal knowledge discovery and upgrading the expert models. In such classifications, the personal models directly interact with the user. For the whole system, the personal models have users' (encrypted) personal information. Moreover, such models must be small enough to be performed on personal computers or mobile devices. Finally, they also have to response in real-time for better user experience and produce high quality results. The proposed personal large models can be applied in a wide range of applications such as language and vision tasks.

gong, language model, p-model, (17 more...)

arXiv.org Artificial Intelligence

2309.14726

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > United Kingdom > England > East Sussex > Brighton (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
(3 more...)

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

Recognizing Hand-written Digits Using Hierarchical Products of Experts

Neural Information Processing SystemsApr-6-2023, 16:57:44 GMT

The product of experts learning procedure [1] can discover a set of stochastic binary features that constitute a non-linear generative model of handwritten images of digits. The quality of generative models learned in this way can be assessed by learning a separate model for each class of digit and then comparing the unnormalized probabilities of test images under the 10 different class-specific models. To improve discriminative performance, it is helpful to learn a hierarchy of separate models for each digit class. Each model in the hierarchy has one layer of hidden units and the nth level model is trained on data that consists of the activities of the hidden units in the already trained (n - l)th level model. After train(cid:173) ing, each level produces a separate, unnormalized log probabilty score.

generative model, hierarchical product, recognizing hand-written digit, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

All state insurance severity prediction

#artificialintelligenceMar-8-2022, 12:16:13 GMT

This problem statement is from the Kaggle recruitment challenge, by Allstate Insurance. Allstate is an insurance services company in the USA, which provides insurance to over 16 million households in the USA. The company wants to reduce the complexity of the insurance claiming process and make it a worry-free experience for the customers by automating the predictions of claims severity. The Allstate Insurance company wants to reduce the time taking process and make it easier for the people who need insurance cover to claim it much easier. So in order to reduce the complexity, It has given a dataset to use machine learning algorithms to predict the costs and hence the severity of the claims accurately.

dataset, prediction, train and test data, (14 more...)

#artificialintelligence

Country: North America > United States (0.45)

Industry: Banking & Finance > Insurance (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.51)

Add feedback

Automate Stacking In Python: How to Boost Your Performance While Saving Time

#artificialintelligenceAug-29-2019, 13:54:48 GMT

Utilizing stacking (stacked generalizations) is a very hot topic when it comes to pushing your machine learning algorithm to new heights. For instance, most if not all winning Kaggle submissions nowadays make use of some form of stacking or a variation of it. First introduced in the 1992 paper Stacked Generalization by David Wolpert, their main purpose is to reduce the generalization error. According to Wolpert, they can be understood "as a more sophisticated version of cross-validation". While Wolpert himself noted at the time that large parts of stacked generalizations are "black art", it seems that building larger and larger stacked generalizations win over smaller stacked generalizations.

artificial intelligence, machine learning, prediction, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Detecting toxic comments with multi-task Deep Learning

@machinelearnbotFeb-20-2018, 18:15:40 GMT

The internet is a bright place, made dark by internet trolls. To help with this issue, a recent Kaggle competition has provided a large number of internet comments, labelled with whether or not they're toxic. The ultimate goal of this competition is to build a model that can detect (and possibly sensor) these toxic comments. While I hope to be an altruistic person, I'm actually more interested in using the free, large, and hand-labeled text data set to compare LSTM powered architectures and deep learning heuristics. So, I guess I get to hunt trolls while providing a casestudy in text modeling.

artificial intelligence, level model, machine learning, (11 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ClassSim: Similarity between Classes Defined by Misclassification Ratios of Trained Classifiers

Arino, Kazuma, Kikuta, Yohei

arXiv.org Machine LearningFeb-5-2018

Deep neural networks (DNNs) have achieved exceptional performances in many tasks, particularly, in supervised classification tasks. However, achievements with supervised classification tasks are based on large datasets with well-separated classes. Typically, real-world applications involve wild datasets that include similar classes; thus, evaluating similarities between classes and understanding relations among classes are important. To address this issue, a similarity metric, ClassSim, based on the misclassification ratios of trained DNNs is proposed herein. We conducted image recognition experiments to demonstrate that the proposed method provides better similarities compared with existing methods and is useful for classification problems. Source code including all experimental results is available at https://github.com/karino2/ClassSim/.

artificial intelligence, machine learning, pattern recognition, (19 more...)

arXiv.org Machine Learning

1802.01267

Genre: Research Report > New Finding (0.68)

Industry: Aerospace & Defense (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.34)

Add feedback

An Effective Way to Improve YouTube-8M Classification Accuracy in Google Cloud Platform

Zhong, Zhenzhen, Huang, Shujiao, Zhan, Cheng, Zhang, Licheng, Xiao, Zhiwei, Wang, Chang-Chun, Yang, Pei

arXiv.org Machine LearningJun-25-2017

Large-scale datasets have played a significant role in progress of neural network and deep learning areas. YouTube-8M is such a benchmark dataset for general multi-label video classification. It was created from over 7 million YouTube videos (450,000 hours of video) and includes video labels from a vocabulary of 4716 classes (3.4 labels/video on average). It also comes with pre-extracted audio & visual features from every second of video (3.2 billion feature vectors in total). Google cloud recently released the datasets and organized 'Google Cloud & YouTube-8M Video Understanding Challenge' on Kaggle. Competitors are challenged to develop classification algorithms that assign video-level labels using the new and improved Youtube-8M V2 dataset. Inspired by the competition, we started exploration of audio understanding and classification using deep learning algorithms and ensemble methods. We built several baseline predictions according to the benchmark paper and public github tensorflow code. Furthermore, we improved global prediction accuracy (GAP) from base level 77% to 80.7% through approaches of ensemble.

dataset, logistic model, prediction, (15 more...)

arXiv.org Machine Learning

1706.08217

Country: Asia > Middle East > Jordan (0.05)

Genre: Research Report (0.67)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep Learning Methods for Efficient Large Scale Video Labeling

Skalic, Miha, Pekalski, Marcin, Pan, Xingguo E.

arXiv.org Machine LearningJun-14-2017

We present a solution to "Google Cloud and YouTube-8M Video Understanding Challenge" that ranked 5th place. The proposed model is an ensemble of three model families, two frame level and one video level. The training was performed on augmented dataset, with cross validation.

artificial intelligence, machine learning, prediction, (20 more...)

arXiv.org Machine Learning

1706.04572

Genre: Research Report (0.64)

Industry: Information Technology (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback